■y^ versus median statistics in SNIa data analysis 
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In this paper we compare the performances of the x^ and median likelihood analysis in the de- 
termination of cosmological constraints using type la supernovae data. We perform a statistical 
analysis using the 307 supernovae of the Union 2 compilation of the Supernova Cosmology Project 
and find that the x^ statistical analysis yields tighter cosmological constraints than the median 
statistic if only supernovae data is taken into account. We also show that when additional measure- 
ments from the Cosmic Microwave Background and Baryonic Acoustic Oscillations are considered, 
the combined cosmological constraints are not strongly dependent on whether one applies the x^ 
statistic or the median statistic to the supernovae data. This indicates that, when complementary 
information from other cosmological probes is taken into account, the performances of the x^ a^nd 
median statistics are very similar, demonstrating the robustness of the statistical analysis. 



I. INTRODUCTION 

More than a decade ago, type la supernovae (SNIa) 
provided the first clear evidence in favor of cosmic accel- 
eration [ilQ- Since then, the availability of ever larger, 
higher-quality SNIa datasets, as well as measurements 
using other cosmological probes, such as the Cosmic Mi- 
crovifave Background (CMB) or the Baryonic Acoustic 
Oscillations (BAO), have been providing overwhelming 
evidence for the existence of dark energy fsL's'], a fluid 
with large negative pressure capable of driving the accel- 
eration of the universe. 

In the so-called ACDM model, also known as the con- 
cordance model, the dark energy role is played by a cos- 
mological constant A, responsible for approximately 73% 
of the energy density of the universe at the present day. 
The remaining percentage is mainly in the form of cold 
matter, most of which non-baryonic and dark. Radia- 
tion is residual at the present time and the universe is 
spatially flat. Despite the good agreement with obser- 
vational data, this model has little appeal on theoreti- 
cal grounds since the value of A required to explain the 
observed cosmic acceleration is off by ^ 120 orders of 
magnitude from the standard quantum field theory pre- 
diction. The coincidence between our observing time and 
the time of the onset of cosmic acceleration is also puz- 
zling [a, 0] . Many other dark energy models have been 
proposed, with the most influential being the ones based 
on dynamical scalar fields. In these models, the energy 
density varies with time and suitable choices of the scalar 
field lagrangian can relax some of the problems associated 
with the cosmological constant (see, for instance, the re- 
view [Sj). As a result, an important step towards a better 
understanding of the dark energy involves further testing 
of its possible dynamical nature. 



The determination of dark energy constraints from ob- 
servational data requires a robust statistical data analy- 
sis. In the particular case of SNIa, the usual procedure is 
to carry out a standard x^ likelihood analysis. However, 
in [3, [l3| it has been argued that the median statistic 
could be a more reliable alternative. The use of the me- 
dian, despite having the drawback of not being as con- 
training as the x^ analysis, has the strong advantage of 
requiring weaker assumptions about the data, thus yield- 
ing more trustworthy constraints. Moreover, the median 
is less vulnerable to the presence of "outliers", which is 
also a significant advantage given current uncertainties 
about the physics of SNIa. 

In this work we revisit and extend the analysis of [lOl | , 
updating it using the recent Union 2 SNIa compilation of 
the Supernova Cosmology Project (SCP) [J|. We com- 
pare the performances of x^ and median statistics in 
the determination of cosmological constraints using SNIa 
data, considering also CMB and BAO measurements. 
The layout of this paper is as follows. In Sec. II we 
describe the application of x^ and median statistics to 
SNIa data. We also discuss the use of additional infor- 
mation contained in the CMB and BAO. In Sec. Ill we 
present and discuss our results. Finally, we conclude in 
Sec. IV. 



II. COSMOLOGICAL PROBES 

A. SNIa 

Type la supernovae appear to be good "standard can- 
dles" and therefore they can serve as useful distance in- 
dicators. Constraints arise by comparing the theoretical 
distance modulus. 
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with the observational distance modulus fiobs inferred 
from the data, at the measured supernovae redshifts. In 



Eq. (H)), dL{z) is the luminosity distance given by, 



2. Median analysis 
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where z is the cosmological redshift, Hq is the present day 
value of the Hubble expansion rate and E{z) = H(z)/Hf). 
The function S is defined as, 



snix, 
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with the flat case {ilko ~ 0) being recovered taking the 
limit V.ko ^- in Eq. 1^. In this paper we consider. 



E{a) = V^rnoa-^ + f^deoa-3(i+'^) + n 
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where a — 1/(1 + z) is the scale factor, n„ia and Qdeo 
are, respectively, the present day values of the matter 
and dark energy fractional densities (fli = Pi/ Pc, with pc 
being the critical density) and ilko = 1 — ^mo — ^rfeo- 
The parameter w is the equation of state parameter of 
the dark energy (the ratio between the pressure and the 
energy density) which, for simplicity, we assume to be 
constant (see however [III, [T^l)- If the dark energy role is 
played by a cosmological constant A then fide = ^A and 
w = —1. 



1. X analysis 

In the x^ statistical analysis, the likelihood P of the 
cosmological parameters Q is given by P ex exp (— x^/2), 
with 



v^ f P-obs{zi) - fith{zi,Q) \ .g^ 



In Eq. ([S]) , iV is the number of supernovae in the dataset 
and CTi is the observational error associated with pobs at 
the redshift Zi. The x^ analysis assumes that: (i) the ex- 
perimental results are statistically independent; (ii) there 
are no systematic errors; (iii) the statistical errors follow 
a Gaussian distribution; (iv) the standard deviation of 
the statistical errors is equal to the observational uncer- 
tainty. 

Presently, there is no strong evidence supporting that 
the supernovae magnitude errors are Gaussianly dis- 
tributed and therefore the hypothesis (iii) and (iv) are 
quite strong. Moreover, the x^ statistic is highly suscep- 
tible to the presence of "outliers" in the datasets. This 
constitutes an extra concern, in particular due to the un- 
certainties associated to the calibration of the SNIa light 
curves. 



The fewer the assumptions one needs to make about 
the dataset, the more reliable the results derived from it 
are. The median statistical analysis discards the assump- 
tion of the Gaussianity of the errors, requiring only the 
use of hypothesis (i) and (ii). 

Assuming that the experimental results are statisti- 
cally independent and that there are no systematic errors 
present, one expects that after performing a sufficiently 
large number of measurements, approximately, half of the 
values obtained will be above (or below) the correct mean 
value. In particular, if we perform N measurements, the 
probability that k of them will be above (or below) the 
median is given by the binomial distribution. 
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This way, given the data set Pobs{zi) {i = 1--A^), the 
likelihood of the cosmological parameters Q is obtained 
by counting the number of observational values that fall 
above (or below) the curve given by Eq. ((IJ. 

Despite being associated with less tighter constraints 
than the x^ statistic, the median statistic is not very sen- 
sitive to the presence of a few "outlier" SNIa objects. In 
Q, it has been shown how the presence of one or very 
few "ill" data points could severely distort a x^ anal- 
ysis, while the median results remained approximately 
the same. 

On the other hand, one should notice that the likeli- 
hood computed with the median statistic only accounts 
for the number of experimental points above or below the 
expected correct curve, not differentiating between the 
various ways in which these points could be distributed. 
For instance, a set where the first half of the data points 
is above the expected curve and the second half is below 
has the same likelihood (when computed with the me- 
dian) as a set where the first point is above, the second 
below, the third above, the fourth below, and so forth. 
These two cases should not be indistinct since the first 
could turn out to be a terrible fit to the data. In [l3| 
it has been shown how some modifications of the me- 
dian statistic could alleviate this problem. In short, the 
modifications involved taking into account the size of the 
largest continuous sequence found above (or below) the 
model's prediction, or the number of total continuous se- 
quences obtained. 

Here, we propose an alternative way to cope with this 
problem. Instead of counting the number of points that 
are above (or below) the model's prediction in the entire 
redshift range of the data set, one may divide the dataset 
into redshift intervals with a certain number of SNIa ob- 
jects and perform the counting in each interval. Suppose 
we divide the dataset into n intervals, with Nj being the 
number of SNIa objects in the j'-th interval (j — l..n). 
This way, the overall likelihood of the parameters Q is 
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FIG. 1: 68.3%, 95.4% and 99.7% confidence level contours on 
QmO and JIao, obtained from the Union 2 SNIa dataset using 
the x^ statistical analysis. 



FIG. 3: 68.3%, 95.4% and 99.7% confidence level contours 
on QmO and w (assuming that flko = 0), obtained from the 
Union 2 SNIa dataset using the x^ statistical analysis. 
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FIG. 2: Same as Fig. [T]but using the median statistical anal- 
ysis. 




FIG. 4: Same as Fig. [3] but using the median statistical anal- 
ysis. 



given by 



P = l[P{k„N,), 



(7) 



j=i 



where kj is the number of points that, in the j'-th interval, 
are above (or below) the theoretical curve given by the 
parameters. This way, by properly dividing the dataset 
into groups of supernovae, we are more likely to avoid 
pathological situations in which very large sequences of 
SNIa above (or below) the median value are present. 



B. The CMB shift parameter 

The CMB shift parameter R is defined by 
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with the redshift of decoupling Zd^c ~ 1090.97 p, 
Il3j . Following the WMAP 7-year results Q we take 
Robs — 1.725 with an error an = 0.018. The likeli- 
hood derived from the shift parameter is then Pcmb oc 
exp(-XcMB/2), with 
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It turns out that assuming that R is Gaussianly dis- 
tributed around Robs with standard deviation an pro- 
vides an efficient summary of the information encoded in 
the full CMB data Ullil]. 
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FIG. 5: 68.3%, 95.4% and 99.7% confidence level contours 
on flmo and Hao, obtained from the Union 2 SNIa dataset, 
combined with constraints on the CMB shift parameter R, 
using the x^ statistical analysis. 



FIG. 7: 68.3%, 95.4% and 99.7% confidence level contours 
on QmO and w (assuming that flko = 0), obtained from the 
Union 2 SNIa dataset, combined with constraints on the CMB 
shift parameter R and the BAO scale, using the x^ statistical 
analysis. 
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FIG. 6: Same as Fig. [5] but using the median statistical anal- 
ysis for the SNIa data. 




FIG. 8: The same as Fig. [7] but using the median statistical 
analysis for the SNIa data. 



C. The BAO scale 

The baryonic acoustic oscillations imprinted in the 
CMB manifest themselves today in the large-scale dis- 
tribution of galaxies. The BAO signature on large scales 
was found in [l6[ , when a small "bump" in the two-point 
correlation function of red-luminous galaxies was mea- 
sured. Cosmological constraints arise via the position of 
the "bump" , which is related to the quantity 
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The likelihood is Pa oc exp (— x^/2), with 
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where zbao = 0.35, A^hs = 0.469 and ga = 0.017 [l7|, 
Il8j . Just like in the case of the shift parameter, it is 
usually safe to assume that A is Gaussianly distributed 



III. RESULTS 

A. SNIa constraints 

Following the discussion in Sec. IIA, we divide the 
SNIa Union 2 dataset (J], which containts a total of 
N = 307 SNIa objects, in groups of approximately 75 
SNIa each. The supernovae are ordered by increasing 
redshift with the first three groups having 76 supernovae 
and the forth group 79. Several other ways to divide the 
set are possible. However, this choice is not critical for 
our conclusions. 



Fit 


fimO 


fifcO 


w 


SNIa ix^) + CMB 


[0.254; 0.310] 


[-0.043; 0.056] 


-1 (fixed) 


SNIa (median) + CMB 


[0.260; 0.342] 


[-0.061; 0.083] 


-1 (fixed) 


SNIa (x^) + CMB + BAG 


[0.269; 0.313] 


[-0.031; 0.049] 


-1 (fixed) 


SNIa (median) + CMB + BAG 


[0.275; 0.333] 


[-0.042; 0.064] 


-1 (fixed) 


SNIa (x^) + CMB + BAG 


[0.264; 0.308] 


O(fixed) 


[-1.084; -0.955] 


SNIa (median) + CMB + BAG 


[0.266; 0.312] 


O(fixed) 


[-1.058; -0.902] 



TABLE I: Constraints on Q,mO, ^deo and w, obtained from SNIa (y^ and median analysis), CMB and BAG data, as well as 
their combinations. The constraints are at the la level. 



Figures[I]and[l]show the 68.3%, 95.4% and 99.7% con- 
fidence level contours on J7mo and JIao , obtained from the 
Union 2 SNIa dataset using the x^ and median statisti- 
cal analysis, respectively. Figures [3] and |4] are similar to 
Figs. [1] and [21 except that now the SNIa constraints are 
on the (r2mo,w) plane and the condition J7feo — was 
assumed. In both cases the Hubble parameter today was 
held fixed at Hq = 70.2km/s/Mpc 5]. As expected, the 
constraints using the x^ statistic are tighter, consequence 
of the stronger assumptions it makes about the data. 
Nevertheless, the constraints obtained using the median 
statistical analysis are not as bad as one could originally 
fear. The median contours are more "stretched" than the 
X^ ones, but their "width" is similar. The median statis- 
tical constraints are in principle more reliable, since they 
do not assume Gaussianity of the SNIa magnitude error 
distribution. 



BAO data. Again, we see that the combined constraints 
are weakly dependent on whether one derives the con- 
straints from SNIa with the x^ or with the median sta- 
tistical analysis. 

Table [J presents a summary of the constraints, at the 
1(7 level, obtained from different data combinations and 
model assumptions. It shows that the median is almost 
as constraining as the x^ statistical analysis, if com- 
bined with additional CMB and BAO constraints. Hence, 
the strong assumption that the SNIa measurements are 
Gaussianly distributed is not necessary in order to ob- 
tain tight constraints. On the other hand, since the x^ 
and median statistical analysis yield similar cosmological 
constraints when the SNIa data is combined with other 
cosmological probes, our results may also be interpreted 
as providing additional validation of the results obtained 
using the standard x^ statistical analysis. 



B. SNIa + CMB + BAO constraints 

FiguresOandilshow the 68.3%, 95.4% and 99.7% con- 
fidence level contours on ^mQ and 17 aq , obtained from the 
Union 2 SNIa dataset (using the x^ and median statis- 
tical analysis, respectively), combined with constraints 
on the CMB shift parameter R. The differences between 
the results obtained with the x^ and median statistics 
are now significantly reduced. This is related to the fact 
that, as shown in Figs. [T] and [21 the "width" of the 
SNIa contours, obtained using the x^ and median analy- 
sis, is very similar. The CMB contours "cross" the SNIa 
contours, rendering similar combined constraints in both 
cases. Considering the BAO data (not shown) tightens 
the constraints slightly (see Table 1). 

Figures [7] and [8] show the combined constraints on Vtmo 
and w (assuming that fi/jQ — 0), obtained from SNIa 
(using x^ and median analysis, respectively), CMB and 



IV. CONCLUSION 

In this paper we compared the cosmological constraints 
derived from the Union 2 SNIa dataset using x^ and 
median statistics. In the absence of CMB and BAO 
constraints, we have shown that the x^ statistic yields 
tighter cosmological constraints than the median statis- 
tic, as a result of the stronger assumptions it makes 
about the SNIa magnitude error distribution. On the 
other hand, when CMB and BAO information is taken 
into account, the performances of both statistics are very 
similar. Hence, we conclude that the assumption of the 
gaussian distribution of the errors in the SNIa analysis 
does not appear to be critical in the determination of 
cosmological constraints, provided that complementary 
information from other cosmological probes is also taken 
into account. 
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